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The  Modified  Cram4r*von  Mises  Goodness-of-Fit  Criterion 

for  Time  Series 

T.  W.  Anderson  and  M.  A.  Stephens 


1.  Introduction 

P.  C.  Mahalanobis  may  be  best  known  among  mathematical  statisticians  for  the 
‘‘Mahalanobis  distance,”  which  in  one  form  is*  (/ij  — —/ij),  where  /ij  and 
ft2  are  the  vectors  of  means  in  two  populations  or  distributions  and  27  is  a  covariance 
matrix  common  to  the  two.  It  is  considered  a  measure  of  the  difference  between  the 
two  distributions.  When  27  is  known  and  /ii  and  are  unknown  and  estimated 
by  sample  means  scj  and  *2)  respectively,  (xj  —  X2y27”*(xi  —  X2),  also  known  as  a 
Mahalanobis  distance,  is  an  estimator  of  the  first  form.  If  Xj  and  X2  are  normally 
distributed  and  hence  Xj  —  X2  is  normally  distributed,  the  Mahalanobis  distance  is  a 
quadratic  form  in  normal  variables.  When  Xi  and  X2  have  covariance  matrices  27  and 
fti  =  /t],  the  Mahalanobis  distance  is  proportional  to  a  variable  with  number  of 
degrees  of  freedom  equal  to  the  number  of  components  of  /ij ,  Xi,  and  X2.  Another 

Mahalanobis  distance  is  (xi  —  fijy£~*(xi  —  /^),  which  is  appropriate  if  an  unknown 
/ij  is  estimated  by  a  sample  mean  Xi. 

In  this  paper  we  consider  somewhat  analogous  quadratic  forms  in  normal  variables 
when  the  dimensionality  is  infinite.  Then  the  quadratic  forms  are  distributed  as 
infinite  weighted  sums  of  x^-variables.  These  come  about  as  goodness-of-fit  criteria 
for  a  hypothesis  that  a  cumulative  distribution  function  is  a  specified  one  or  that  two 
cdf ’s  are  the  same.  Such  criteria  also  arise  for  goodness-of-fit  tests  for  standardized 
spectral  distributions. 

As  examples,  we  give  tables  of  the  distribution  of  the  criterion  for  testing  the 
hypothesis  that  a  stationary  stochastic  process  is  a  given  moving  average  process  of 

*The  quadratic  form  should  be  called  “Mahalanobis  distance  squared”,  but  the  third  word  is 
usually  omitted. 
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order  1  and  for  testing  the  hypothesis  that  it  is  a  specified  autoregressive  process  of 
order  1.  Two  methods  are  described  for  calculating  the  distribution.  Either  method 
is  appropriate  for  calculating  the  distribution  of  the  criterion  for  testing  the  hypoth¬ 
esis  that  a  process  is  a  stationary  process  whose  standardized  spectral  density  or 
distribution  is  a  specified  one. 

Test  for  a  given  probability  distribution.  Let  Xi, •••,£„  be  n  observations  from 
a  distribution  with  cdf  F(x)  and  let  the  empirical  distribution  of  the  sample  be 
f'n(x)  =  k/n  if  k  of  the  x,’s  are  not  greater  than  x.  To  test  the  null  hypothesis 
F(x)  =  Fo(x),  where  /b(x)  is  specified,  the  Cramer-von  Mises  statistic  is 

(1)  =  n  r  |F.(x)  -  Fo(x)l’iF„(x). 

«/— oo 

When  Fo(x)  is  continuous  and  F{x)  =  Fo(a:)»  N/w[^n(a;)  —  ^o(ic)]»  considered  as  a 
stochastic  process  with  time  parameter  x,  converges  weakly  to  a  Gaussian  process  with 
covariance  function  min[Fo(x),  Fo(y)]--Fo(x)Fo(y),  and  has  a  limiting  distribution 
as  a  quadratic  functional  of  the  process.  If  we  make  the  monotonic  tansformation 
u  =  Fo(x)  and  u,  =  ?  n,  then  the  Ui  can  be  considered  as  observations 

from  the  uniform  distribution  on  [0, 1].  Let  Hn{u)  =  F„[F“*(u)].  Then  y/n[Hn{u)  —  v.] 
converges  weakly  to  a  Gaussian  process,  say  J(u),  with  covariance  function 

(2)  SJ{u)J{v)  =  h{u,  v)  =  min(u,  v)  —  uv. 

In  fact,  the  limiting  process  /(u)  is  the  Brownian  bridge. 

Suppose  Fi(x)  and  ^2(2?)  are  two  cdf’s,  and  and  F„,(x)  are  the  empirical 

cdf's  of  samples  of  size  ni  and  n2  obtained  from  the  two  populations,  respectively. 
Then 

is  a  criterion  for  testing  the  hypothesis  Fi(x)  =  F2{x).  When  the  hypothesis  is  true, 

(3)  has  the  same  limiting  distribution  as  (1).  The  statistic  (3)  might  be  considered 

as  analogous  to  the  Mahalanobis  distance  (xi  —  ~  ^2))  where  S  is  an 

estimator  of  S. 

Test  for  a  given  spectral  distribution.  Cramer-von  Mises  tests  can  be  given  for 
testing  hypotheses  about  spectral  distributions  of  stationary  stochastic  processes.  In 
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many  situations  questions  arise  about  the  pattern  of  dependence;  these  questions 
pertain  to  the  autocorrelations  or  equivalently  to  the  Fourier  transform  of  the  auto¬ 
correlations,  which  we  term  the  standardized  spectral  density. 

Consider  a  stationary  stochastic  process  {y<},  t  =  •  •  • ,  —1, 0, 1,  •  •  •  with  Syt  =  0, 
autocovariance  function  Sytyt-^h  =  ^  autocorrelation 

function  ph.  =  h  =  •  •  • ,  —1,0, 1,  •  •  • .  We  define  the  standardized  spectral 

density  as 

(4)  /(A)  =  ^  53  ^fcCosAh,  -t<A<x. 

h=-<X) 

Note  that  the  coefficients  of  the  trigonometric  functions  are  the  autocorrelations,  not 
the  autocovariances.  The  Fourier  transform  of  the  standardized  spectral  density  is 


(5)  /  /(■^)coS‘^*»  h  =  •••,— 1,0, I*--. 

Knowledge  of  the  standardized  spectral  density  is  equivalent  to  knowledge  of  the  au¬ 
tocorrelations.  Since  /(A)  =  /(—A),  we  define  the  standardized  spectral  distribution 
as 

(6)  f  (X)  =  2  jf '  /(»)i(»)  =  i  (a  +  2  £  . 

Note  that  F{t)  =  1;  the  standardized  spectral  distribution  has  the  properties  (non¬ 
negative  increments)  of  a  probability  distribution  on  [0,7r]. 

If  the  sample  is  j/i,  *  ■  • ,  yr,  the  sample  autocovariance  sequence  may  be  defined  as 

j  T-h 

(7)  Cfc  =  c_fc  =  —  —  53  iftyt+ky  h  =  0, 1,  •  •  • ,  r  1, 

^  ^  1=1 


Note  that  c^  here  is  an  unbiased  estimator  of  The  reason  for  using  the  unbiased 
estimator  instead  of  the  biased  estimator  (that  is,  yiyt+k/7’  as  in  Anderson 
(1993))  is  that  then  the  asymptotic  theory  serves  as  a  better  approximation  to  the 
small  sample  behavior.  The  sample  autocorrelation  sequence  is  defined  as  r*  =  r_fc  = 
Ck/co,  h  =  0, 1,  ■  •  • ,  r  —  1;  the  standardized  sample  spectral  density  is 


(8) 


/r(A)  = 


27rcto 


Eyie 


,tA( 


iszl 


1 

—  51  TfcCosAh,  — ir  <  A  <  tt; 

2’*’  h=-(T-l) 
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and  the  standardized  sample  spectral  distribution  function  is 


(9)  Ft(X)  =  2  jf" lT{v)dv  =  i  (a  +  2g . 

To  test  the  hypothesis  that  F{x)  =  Fo(x),  where  Fo(x)  is  completely  specified, 
one  can  use  the  Cramer*von  Mises  statistic 


Consider  the  discrepancy 


/  ,  \  i^r  FI  / ,  \  F^/ ,  M  2y/T  ^  sin  Ah ,  .  2\/T  ^  sin  Ah 

n  Ft  A  -  f  (A  =  -^  E  -  m)  -  E 

”  tA  ^  ^  k.T  >' 


as  a  stochastic  process  over  [0,ir].  The  monotonic  transformation  u  =  6^(A)/G(r), 
where 

(12)  G(A)  =  2 /"/’(W-lo. 

JO 

can  be  made  from  A  to  u  to  obtain  a  process  on  [0, 1].  Then  (11)  is  transformed  to 


(13)  Yt(u)  =  -Jf  [FT(G-‘lG(!r)«l)  -  (G(f)«1}]  , 

and  lrj(u)/[2^FG(F)]  converges  weakly  to  the  Gaussian  process  with  covariance  func- 
tion 

(14)  h(u,  v)  =  h(u,  v)  +  g('a)9(v), 


where 

(15)  ,(u)  =  «  -  F(G-'lG(F)n]), 

and  h(u,v)  is  given  by  (2).  If  B(u)  is  the  Brownian  bridge  [that  is,  £B{u)  =  0  and 
SB{u)B{y)  =  h(«,  u)],  then  the  limiting  process  above  is  B(u)  +  q{u)X,  where  X  has 
the  standard  normal  distribution  N{0,1). 

Test  for  a  probability  distribution:  composite  hypothesis.  Suppose  {/'(x|d)}  is  a 
family  of  probability  cdf’s  indexed  by  the  scalar  parameter  9.  If  0  is  an  estimator  of 
6,  then 

(16)  n  r  [F,(i)  -  f  (l|«)lVF(i|«) 

J—oo 
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can  be  used  to  test  the  null  hypothesis  that  the  cdf  sampled  belongs  to  the  class 
{F(x|<l)}.  Let  the  density  of  F(x|fl)  be  /(x|fl);  let 


(17) 


'aiog/(x|g)l^  _ 

dB  ' 


and  let 


m(u)  = 


aiogFlF-'(l|«)l*l 


If  0  is  an  efficient  estimator  such  as  the  maximum  likelihood  estimator,  the  pro¬ 
cess  v^{F„[F“*(<|d)]  —  F[F~*(t|0)|5]}  converges  weakly  to  a  Gaussian  process  with 
covariance  matrix  h{u,v)  —  m{u)m{v)fX. 

In  this  paper  we  are  interested  in  finding  numerically  the  distributions  of  the  lim¬ 
iting  Cramer- von  Mises  statistic  when  the  covariance  function  has  the  form  h{u,  v)  ± 
r(u)r(t;).  In  the  case  of  a  standardized  spectral  distribution  the  covariance  function 
has  this  form  with  r(u)  =  q(u)  and  the  -|-  sign;  in  the  case  of  probability  distri¬ 
butions  with  one  estimated  parameter  the  covariance  function  has  this  form  with 
r(u)  =  m(u)/\/l  and  the  —  sign. 


2.  Use  of  integral  equations 

First  we  sketch  the  general  theory  of  finding  the  distribution  of  a  quadratic 
integral  of  a  continuous  time  parameter  Gaussian  stochastic  process  with  a  given  co- 
variance  function  by  the  example  of  the  Cramer- von  Mises  criterion  (1).  Any  function 
0  <  u,t;  <  1,  that  is  symmetric  in  u  and  v,  continuous,  and  square  integrable 
(in  one  and  both  variables)  can  be  expressed  as 

(19)  fc(u,t;)  =  ■^/i(«)/>(t')» 

>=i  ''i 

where  Xj  is  an  eigenvalue  and  /j(u)  the  corresponding  normalized  eigenfunction  of 
the  integral  equation 

(20)  f{u)  =  xfk{u,v)f{v)dv 

Jo 

and 

(21)  [  fi{u)fj{u)du  =  Sij, 

Jo 
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where  Su  =  1  and  Sij  =  0,  i  ^  j.  If  A:(0,0)  =  A:(l,  1)  =  0,  then  /,(0)  =  /<(!)  =  0.  If 
ib(u,  v)  is  the  covariance  function  of  a  stochastic  process  Z{u),  0  <  ti  <  1,  then  k{u,  v) 
is  positive  definite  and  Aj  >  0.  The  series  (19)  converges  absolutely  and  uniformly  in 
the  unit  square.  If  Z{u)  is  Gaussian  and  SZ(u)  =  0,  then  it  has  the  representation 

(22)  Z(u)  =  g 

i=i 

where  Xi,X2, •••  are  independent  A^(0, 1)  variables.  With  probability  1, 

(23)  /'  Z\u)du  =  /'  f; -j=XiMu)-LxiMu)du  =  f;  yX], 

Jo  Jo  j_|  \/Ai  jsl 

The  characteristic  function  of  (23)  is 


5  exp 


The  function  D{X)  =  n^i(l  “  ^/^j)  Is  known  as  the  Fredholm  determinant  of  the 
integral  equation  (20). 

The  Brownian  bridge  with  covariance  function  /i(u,  v)  =  min(u,  v)  —  uv  has  the 
representation  (22)  with  A,  =  and  fj{u)  =  \/2sinj?ru.  The  characteristic  func¬ 
tion  (24)  is  (sin  v^if/V^)"^  [Anderson  and  Darling  (1952)]. 

Now  consider  k{u,  u)  =  /»(u,  v)  -|-  q{u)q{v).  We  define  the  Fourier  coefficients 

(25)  Qj  =  q{u)fj{u)du 

2y/2  r  .  [.  G(A)'j  fG(A)  f2(\\j\ 

=  GW  y.  row]  [gw  '  ^ 

Then  ^(u)  =  The  process  B{u)  -j-  Xq{u)  has  the  representation 


B(u)  +  Xq(u)  =  g  j  /,(«), 


and  the  Cramer- von  Mises  criterion  has  the  representation 


S  =  f^[B{u)-\-Xq{u)fdu 
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Alternatively,  there  is  a  representation 

(28)  k(u,  n)  =  5]  ^9jiu)gj{v), 

j=i 

where  k{u,  v)  is  given  by  (14)  and  t/j  and  gj{u)  are  the  eigenvalues  and  eigenfunctions 
associated  with  the  integral  equation 

(29)  g(u)  =  i//  k{u,v)g(v)dv. 

Jo 

The  Fredholm  determinant  of  (29)  is 

(30)  jP*(t/)  =  f  I  c{uyt\v)q{u,)q{t)dudt  — V  f  g*(u)du}, 

y/U  Jo  Jo  Jo 

where 

sm(v^u) s\n[y/v{t  -  1)],  u  <  f , 

sin[Vi7(«  -  1)]  sin(v^<),  u>t. 

The  eigenvalues  of  k{u,v)  are  the  values  of  i/  that  satisfy  /?*(*')  =  0.  The 

function  c(«,  t\  v)  is  the  resolvent  or  resolving  kernel  of  the  kernel  min(u,  i)  —  ut.  The 
characteristic  function  of  5  is  [Zl*(tt)]'». 

\xtXjlJXj  +  aiX  =  Yi,  i  =  1, 2,  •  •  • .  Then  (27)  is 

(32)  S=^Yl. 

J=l 

The  Yj  are  normally  distributed  with  SVj  =  0,  SVJ^  =  1/Aj  +  aj,  and  SViYj  = 
QiOj,  i  ^  j.  Then  S  can  be  approximated  by  a  finite  sum  Sat  =  YJ^.  The 
difference  S  —  Sn  has  expectation 

(33)  f(5-5;v)  =  £  Z  f  ff +  «?), 

j=N+l  \''i  / 
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which  can  be  made  arbitrarily  small  by  taking  N  sufficiently  large.  Hence,  as  N  —* 
oo,  the  distribution  of  Ss  converges  to  the  distribution  of  S  and  the  characteristic 
function  of  Sn  approaches  the  characteristic  function  of  5. 

Let  Yn  be  the  A^-vector  with  Yj  as  the  jth  component.  The  covariance  matrix 
of  Yfi  is  SYsYf^  =z  +  Qnq'js/,  where  is  the  diagonal  matrix  with  l/Xj  as  the 
jth  diagonal  element  and  ajv  is  the  vector  with  aj  as  the  jth  component.  Then  the 
characteristic  function  of  Sn  is 

(34)  =  \In  -  2iti\N  +  =  11(1  -  2it<f>jN)-^f\ 

i=i 

where  <f>jN  {<I>in  >  <I>2S  >  •  *  •  >  4>nn)  is  the  jth  characteristic  root  of  An  +  oino^n^ 
that  is,  the  jth  zero  of 


(35)  |Ayv  +  ocnq'n  ~  =  |Aa^  -  ^/jv|0(^), 

where 

(36)  = 1 + 1: 

i=i  A>  ^ 

for  ^  ^  ^/Xj.  If  Oj  0  (as  is  the  case  for  the  MA(1)  and  AR(1)  tests  tabulated 
below),  1/Xj  is  not  a  zero  of  (35).  Let 


(37) 


KM 


|/iv  —  v{An  +  Qt/vajv)! 


Since  °°  ~  So  9*(«)<^«  <  oOj^)v(*')  converges  to  D*{i/),  the 

Fredholm  determinant,  as  N  —*  oo.  The  characteristic  function  of  S  is  l/y/D-(2it). 

When  the  explicit  form  (30)  of  the  Fredholm  determinant  is  intractable  or  cannot 
be  inverted,  it  can  be  approximated  by  D^{v)  given  by  (37).  For  this  we  turn  to  the 
numerical  evaluation  of  the  ^jn,  the  characteristic  roots  of  An  +  and  the  zeros 

of  (35).  These  approximate  the  reciprocals  of  the  first  N  eigenvalues  of  k{u,v).  The 
first  derivative  of  V’(^)  is 


(38) 


E 


j=l 


>0. 
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As  ^  ♦  1/Aj  from  above,  — »  — c»,  and  as  ^  »  1/Aj  from  below,  oo. 

Since  0(^)  is  continuous  except  at  ^  3  —  1?  •  •  ■ » there  is  one  root  in  the 

interval  (1/Aj,  1/Aj_i),  3  =  1, •  •  •  ,7V,  where 

(39)  i/ao  =  i;q;. 

j=i 

3.  Calcuiation  of  the  distributions  of  test  statis¬ 
tics 

In  this  section  the  theory  given  above  is  employed  to  provide  tests  for  two  time 
series  models,  the  MA(1)  and  AR(1)  processes.  Asymptotic  points  are  given  with  the 
tests;  the  procedure  for  calculating  these  points  follows  closely  the  methods  employed 
by  Stephens  (1976)  for  obtaining  asymptotic  points  for  goodness-of-fit  statistics  in 
connection  with  tests  for  distributions.  We  first  find  approximations  to  the  Vj,  or 
more  precisely,  the  values  Uj  =  \fvj,  in  the  representation  (28).  These  are  the  zeros 
<i>  of  (36).  They  were  found  by  two  methods,  one  used  to  check  the  other,  and  in  each 
case  asymptotic  points  were  calculated. 

Method  of  Fourier  coefficients.  Recall  that  when  the  covariance  of  the  Gaus¬ 
sian  process  is  h{u,v)  in  (2),  the  eigenvalues  are  Aj  =  and  the  eigenfunctions  are 
fj{u)  =  \/2sin(7rju).  la  order  to  apply  the  results  of  Section  2  above,  we  need  values 
aj  as  defined  by  (25),  where  the  functions  q{u)  depend  on  the  asymptotic  Gaussian 
process  for  the  time  series  model.  Anderson  (1993)  gave  explicitly  the  functions  F(A) 
and  G(A)  for  the  MA(1)  and  AR(1)  models.  The  values  of  aj  defined  by  the  second  ex¬ 
pression  in  (25)  were  calculated  by  numerical  integration  for  j  =  1,  •  •  • ,  40;  they  are  all 
nonzero,  but  rapidly  become  very  small  (of  the  order  of  10”^  for  3  =  40).  Note  that  if 
the  first  expression  in  (25)  is  approximated  by  \/2S"=o9(*7”)sin(i7r*7n)/n,  obtained 
by  straightforward  application  of  the  trapezoidal  rule,  the  Fast  Fourier  Transform  can 
be  used  to  obtain  values  of  Oj  for  many  3.  For  this  calculation,  q{u)  given  by  (15) 
must  be  found.  This  was  done  by  calculating  a  table  of  values  of  G{X)  at  intervals 
small  enough  to  obtain  a  first  approximation  to  G~^[G(7r)u]  by  linear  interpolation; 
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the  approximation  was  then  refined  by  Newton’s  method. 

The  zeros  of  (36)  can  be  found  by  search  since  (38)  indicates  that  ri>{4>)  is  always 
increasing  between  vertical  asymptotes  at  1/Aj.  The  Wj  for  j  =  41,..,,  100  were 
approximated  by  observing  that  —*  k,dL  constant  which  is  determined  from  the 
last  calculated  values. 

Method  of  discretization  With  modem  computing  resources  we  can  find  approx¬ 
imate  values  of  u)j  by  creating  a  matrix  K  with  entries  kij  =  Jk(ti„  Uj);  it(u,  v)  is  given 
by  (14)  and  m  =  i/(n  -|- 1)  and  Vj  =  j7(n  -f- 1),  for  i,j  =  1, . . .  ,n.  The  characteristic 
roots  and  vectors  of  K  then  approximate  and  the  corresponding  eigenfunctions 
gj{u)  of  (29).  This  computation  wjis  done  for  n  =  50, 100  juid  200, 

Calculation  of  asymptotic  points  When  the  set  of  n  values  of  Uj  is  found,  the 
distribution  of  S  given  by  (23)  or  (32)  is  approximated  by  the  distribution  of 

(40)  +c, 

j=i 

where  Xj  are  independent  standard  normals,  and  c  is  a  constant,  obtained  from  the 
fact  that 

(41)  €S  =  f  k{u,u)du  =  ST  =  +  c. 

The  integral  /J  A:(«,  u)du  =  1/6-1- q'^{u)du,  and  the  last  integral  is  found  numerically, 
thus  determining  c.  It  might  be  thought  that  c  should  be  replaced  by  a  random 
variable,  say  u;oo,  but  the  difference  in  ST  and  is  sufficiently  small  that  the 

variance  of  Wo©  is  negligible. 

Given  wi ,  •  •  • , a;„,  T  is  a  sum  of  c  and  a  weighted  sum  of  x^’s,  each  with  1  degree  of 
freedom.  Then  Pr{r  —  c  <  a;}  can  be  found  by  Imhof’s  method  (for  which  computer 
programs  are  available).  For  selected  values  of  the  probabilities  the  values  of  x  were 
computed  for  Tables  1  and  2.  It  was  found  that  when  x  for  a  given  probability  was 
plotted  against  1/n  (n  =  50, 100,200)  in  the  second  method  the  plot  was  linear  and 
hence  could  be  extrapolated  to  1/n  =  0  to  give  the  final  percentage  points  at  this 
probability.  The  values  obtained  in  this  way  agreed  to  those  by  the  first  method  to 
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4  decimal  places  in  the  upper  tail  (which  is  the  tail  used  for  significance  points)  and 
differed  by  only  1  in  the  fourth  decimal  place  in  the  lower  tail.  If  only  one  method  is 
to  be  used,  the  method  of  Fourier  coefficients  seems  to  be  preferable. 


4.  Test  for  the  moving  average  model  of  order  1 

For  this  model,  the  time  series  yt  is  given  by  yt  =  ut  +  aut-i,  where  the  u<  are 
uncorrelated  with  mean  zero  and  variance  Let  the  autocovariance  be  Sytyt+h  = 
the  autocorrelation  is  ph  =  Then  po  =  1,  Pi  =  P-i  =  p  =  oKl+a^), 

and  Pa  =  0  for  h  ^  —  1,  0  or  1.  The  test  below  is  for  a  moving  average  model  with  a 
(or  equivalently  p)  known.  Suppose  yi,y2,--*»yr  is  a  sample  of  T  successive  values 
of  yt.  The  sample  autocovariances  are  given  by  (7),  and  the  sample  autocorrelations 
are  r*  =  r_A  =  Ch/co,  h  =  0, 1, . . .  ,r  -  1. 


The  test  statistic  The  Cramer-von  Mises  criterion  for  a  time  series  process  yields 
a  test  statistic  with  computing  formula  (Anderson,  (1993)] 


(42) 


Q  = 


8ir*CP{ 


T  (T-1 


(^g  Pg)(Pr+g  Pr-g) 


I' 


where  G(')  depends  on  the  process.  In  calculating  Q  from  (11)  the  second  sum  on 
the  right-hjind  side  of  (11)  has  been  omitted  because  it  converges  to  zero.  For  the 
MA(1)  process,  G(5r)  =  (1  +  2p^)/(27r)  [Anderson,  (1993)].  Also  since  p*  takes  values 
only  for  h  =  —1,0  and  1,  the  formula  (41)  simplifies  considerably  for  this  process:  for 
a  given  r,  the  sum  over  g  cau  be  divided  into  two  sums  for  g  from  — r  —  1  to  — r  -|- 1 
and  for  g  from  r  —  1  to  r  +  1.  The  test  for  the  MA(1)  process  with  known  o,  that  is, 
with  known  p,  then  consists  of  the  following  steps: 


1.  Calculate  the  for  ^  =  1, . . . ,  T  —  1; 

2.  Calculate  Q; 


3.  Refer  Q  to  the  asymptotic  points  in  Table  1  for  the  corresponding  p.  Reject  the 
MA(1)  model  at  level  p  if  Q  exceeds  the  percentage  point  given  for  level  p. 


The  points  are  asymptotic,  but  extensive  Monte  Carlo  studies  indicate  that  they  can 
be  used  with  good  accuracy  for  T  >  50,  a  reasonable  length  for  most  time  series. 

5.  Test  for  the  autoregressive  model  of  order  1 

For  this  time  series,  yt  =  pyt-i  +  where  the  Ut  are  as  defined  above.  Then, 
Ph  =  P-h  =  P*,  and  G(jr)  =  (1  +  p^)/  {2t(1  —  p^)}.  Given  a  sample  of  length  T,  the 
test  for  AR(1)  with  known  p  follows  the  three  steps  given  in  the  previous  section,  but 
Q  is  now  referred  to  the  asymptotic  points  in  Table  2  for  the  corresponding  p.  Reject 
the  AR(1)  model  at  level  p  if  exceeds  the  percentage  point  given  for  level  p. 

Here  again,  the  asymptotic  points  will  be  adequate  for  T  >  50.  For  this  process, 
Q  is  more  complicated  to  calculate  and  it  is  important  to  keep  to  the  correct  limits  for 
both  r  and  g.  For  both  models,  a  FORTRAN  program  to  calculate  the  test  statistic 
and  also  to  look  up  the  appropriate  table,  is  available  from  the  second  author. 

The  authors  are  very  grateful  to  Dr.  Richard  Lockhart  for  valuable  assistance 
with  the  computational  procedures  in  this  paper. 


References 

Anderson,  T.  W.  (1993).  Goodness  of  fit  tests  for  spectral  distributions.  The  Annals 
of  Statistics  21,  830-847. 

Anderson,  T.  W.,  and  D.  A.  Darling  (1952).  Asymptotic  theory  of  certain  ‘goodness 
of  fit’  criteria  based  on  stochastic  processes.  Ann.  Math.  Statist.  23,  193-212. 

Stephens,  M.  A.  (1976).  Asymptotic  results  for  goodness-of-fit  statistics  with  un¬ 
known  parameters.  Annals  of  Statistics,  4,  357-369. 


12 


Table  1.:  Asymptotic  Points  For  MA(1)  Test 


p\p 

0.001 

0.005 

0.01 

0.025 

0.05 

0.1 

0.15 

0.25 

0.0 

0.0170 

0.0218 

0.0248 

0.0304 

0.0366 

0.0460 

0.0543 

0.0703 

0.1 

0.0170 

0.0218 

0.0248 

0.0304 

0.0367 

0.0462 

0.0545 

0.0706 

0.2 

0.0171 

0.0219 

0.0250 

0.0306 

0.0369 

0.0466 

0.0551 

0.0715 

0.3 

0.0172 

0.0221 

0.0252 

0.0309 

0.0373 

0.0471 

0.0558 

0.0726 

0.4 

0.0173 

0.0222 

0.0253 

0.0310 

0.0375 

0.0474 

0.0561 

0.0731 

0.5 

0.0172 

0.0221 

0.0252 

0.0310 

0.0374 

0.0472 

0.0559 

0.0727 

Table  L:  Asymptotic  Points  For  MA(1)  Test  cont. 


p\v 

0.5 

0.75 

0.85 

0.9 

0.95 

0.975 

0.99 

0.995 

0.999 

0.0 

0.1189 

0.2094 

0.2841 

0.3473 

0.4614 

0.5806 

0.7435 

0.8694 

1.1679 

0.1 

0.1198 

0.2118 

0.2878 

0.3522 

0.4684 

0.5898 

0.7557 

0.8839 

1.1879 

0.2 

0.1221 

0.2172 

0.2961 

0.3630 

0.4837 

0.6099 

0.7821 

0.9152 

1.2308 

0.3 

0.1244 

0.2221 

0.3034 

0.3722 

0.4966 

0.6264 

0.8037 

0.9408 

1.2657 

0.4 

0.1253 

0.2238 

0.3057 

0.3751 

0.5004 

0.6313 

0.8100 

0.9481 

1.2755 

0.5 

0.1244 

0.2219 

0.3028 

0.3714 

0.4951 

0.6245 

0.8011 

0.9376 

1.2611 
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Table  2.:  Asymptotic  Points  For  AR(1)  Test 


p\p  0.001 


0.0  0.0170 
0.1  0.0170 
0.2  0.0171 


.01  0.025 


0.0218  0.0248  0.0304 


3.0218 

0.0219 


0.0248 

0.0250 


0.0304 

0.0306 


0.0366 

0.0367 


0.0460  0.0543 


0.0462 


0.0370  0.0467 


0.0545 

0.0552 


0.0703 

0.0706 

0.0717 


0.3  0.0172  0.0221  0.0252  0.0310  0.0375  0.0474  0.0562  0.0734 


0.4  0.0174  0.0224  0.0256  0.0315 


0.5  0.0176 
0.6  0.0178 
0.7  0.0180 
0.8  0.0181 
0.9  0.0182 


0.0227 

0.0230 

0.0233 

0.0235 

0.0236 


0.0259 

0.0263 

0.0267 

0.0269 

0.0271 


0.0320 

0.0326 

0.0330 

0.0334 


0.0382 

0.0389 

0.0396 

0.0402 


0.0484  0.0576  0.0755 


0.0495  0.0590 


0.0506 

0.0514 


0.0603 

0.0614 


0.0407  0.0520  0.0622 


0.0336  0.0409  0.0524  0.0626 


0.0776 

0.0796 

0.0812 

0.0822 

0.0828 


Table  2.:  Asymptotic  Points  For  AR(1)  Test  cont. 


p\p 

0.5 

0.75 

0.85 

0.95 

0.975 

0.99 

0.995 

0.999 

0.0 

0.1189 

0.2094 

0.2841 

0.3473 

0.4614 

0.5806 

0.7424 

0.8694 

1.1679 

0.1 

0.1198 

0.2119 

0.2880 

0.3525 

0.4688 

0.5904 

0.7567 

0.8848 

1.1891 

0.2 

0.1226 

0.2188 

0.2988 

0.3666 

0.4890 

0.6168 

0.7923 

0.9263 

1.2462 

0.3 

0.1267 

0.2288 

0.3140 

0.3864 

0.5169 

0.6533 

0.8393 

0.9831 

1.3242 

0,4 

0.1316 

0.2399 

0.3307 

0.4079 

0.5470 

0.6924 

0.8905 

1.0439 

1.4072 

0.5 

0.1364 

0.2505 

0.3465 

0.4280 

0.5749 

0.7284 

0.9378 

1.0997 

1.4831 

0.6 

0.1406 

0.2595 

0.3596 

0.4447 

0.5980 

0.7581 

0.9765 

1.1453 

1.5469 

0.7 

0.1438 

0.2662 

0.3694 

0.4570 

0.6150 

0.7800 

1.0051 

1.1790 

1.5901 

0.8 

0.1460 

0.2707 

0.3758 

0.4651 

0.6262 

0.7943 

1.0238 

1.2010 

1.6213 

0.9 

0.1472 

0.2731 

0.3792 

0.4695 

0.6321 

0.8020 

1.0337 

1.2127 

1.6388 
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